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The construction of a set of transmissible gastroenteritis coronavirus (TGEV)-derived replicons as bacterial 
artificial chromosomes is reported. These replicons were generated by sequential deletion of nonessential genes 
for virus replication, using a modified TGEV full-length cDNA clone containing unique restriction sites 
between each pair of consecutive genes. Efficient activity of TGEV replicons was associated with the presence 
of the nucleoprotein provided either in cis or in trans. TGEV replicons were functional in several cell lines, 
including the human cell line 293T, in which no or very low cytopathic effect was observed, and expressed high 


amounts of heterologous protein. 


Since the etiologic pathogen causing severe acute respira- 
tory syndrome was identified as a coronavirus (6, 8, 12, 15, 16, 
18, 25), the study of coronavirus molecular biology has ac- 
quired significant relevance in order to develop effective strat- 
egies to prevent and control coronavirus infections. The design 
of antiviral drugs interfering with coronavirus replication is a 
rational approach requiring a detailed study of the replication 
mechanism at the molecular level. Coronavirus-derived repli- 
cons should be useful tools to select effective interfering mol- 
ecules. 

Transmissible gastroenteritis coronavirus (TGEV) is a mem- 
ber of the Coronaviridae family within the order Nidovirales, 
composed of enveloped single-stranded, positive-sense RNA 
viruses relevant in animal and human health (7). About two- 
thirds of the 28.5-kb TGEV genome encodes the replicase 
gene, which comprises open reading frames 1a and 1b, the last 
one being expressed by ribosomal frameshifting (23). Transla- 
tion of both open reading frames results in the synthesis of two 
large polyproteins that are processed by viral proteinases to 
yield the replicase-transcriptase complex (38). The 3’ one-third 
of the genome includes the genes encoding the structural and 
nonstructural proteins, in the order 5’-S-3a-3b-E-M-N-7-3’. 
These proteins are expressed by a discontinuous transcription 
process that most probably takes place during the synthesis of 
the negative strand, leading to the generation of a 3’ cotermi- 
nal nested set of subgenomic mRNAs, each of which has at its 
5’ end a capped leader sequence derived from the 5’ end of the 
genome (26, 39). Synthesis of subgenomic negative-sense RNA 
species is regulated by the transcription-regulating sequences 
(TRSs), which include a highly conserved core sequence that is 
found preceding each gene and at the 3’ end of the leader 
sequence (3). 

Little is known about the genome replication of coronavirus 
at the molecular level, and until recently, study was restricted 
to the analysis of defective interfering RNA genomes that are 
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amplified by a helper virus (13, 24, 34), temperature-sensitive 
mutants that are defective in RNA synthesis (27, 29), and the 
analysis of recombinant viruses generated by targeted recom- 
bination (19). The recent construction of coronavirus full- 
length cDNA clones (2, 4, 30, 35-37) provides an opportunity 
for the genetic manipulation of coronavirus genomes to study 
fundamental viral processes and to develop expression vectors. 
In fact, the generation of coronavirus self-replicating RNAs 
(replicons) provides a valuable avenue to explore the molecu- 
lar bases of coronavirus genome replication. This approach has 
been used to analyze the replication and transcription mecha- 
nism of many positive-strand RNA viruses (1, 11, 17, 20, 31, 
33). 

In this article, we report the generation and analysis of a 
collection of TGEV-derived replicons that are functional in 
several cell lines and express high amounts of heterologous 
proteins. In addition, our data indicate that the N protein is 
required either in cis or in trans for efficient coronavirus-based 
replicon activity. 

Strategy for construction of TGEV-derived replicons. The 
generation of a TGEV replicon was based in a first approach 
on published data showing that for many positive-strand RNA 
viruses, only the replicase gene was needed for autonomous 
replication of the viral RNA (9, 14, 17, 20). Accordingly, a 
cDNA that contains the 5’ and 3’ ends of the TGEV genome 
and the replicase gene was cloned as a bacterial artificial chro- 
mosome (BAC) under the control of the cytomegalovirus 
(CMV) immediate-early promoter following the same ap- 
proach as that described for the generation of a TGEV full- 
length cDNA clone (2, 10). Unfortunately, after transfection of 
swine testis (ST) cells with the replicon, using Lipofectine 
(Invitrogen), no significative replicase activity was detected 
(data not shown). One possible explanation for this lack of 
activity could be the low transfection efficiency of ST cells (less 
of 0.1%) with large-size coronavirus replicons. To test this 
possibility, the transfection efficiency of TGEV BAC clones 
was optimized. Baby hamster kidney cells stably transformed 
with the gene encoding the porcine aminopeptidase N (BHK- 
pAPN) (5) and human 293T cells (American Type Culture 
Collection) were used in the optimization process, as they are 
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transfected with efficiencies about 10- and 20-fold higher, re- 
spectively, than ST cells and because they also are permissive 
for TGEV replication. After more than 12 commercial trans- 
fection reagents were tested, Lipofectamine 2000 (Invitrogen) 
was selected for transfection of BACs containing TGEV-de- 
rived replicon cDNA clones (about 30 kb), leading to trans- 
fection efficiencies of 20 and 42% in BHK-pAPN and 293T 
cells, respectively (F. Almazan, C. Galan, and L. Enjuanes, 
unpublished data). 

After optimization of BAC transfection, a collection of 
TGEYV replicons was generated to study the roles of other viral 
genes, in addition to the replicase gene, on coronavirus repli- 
con activity. Genes 3a, 3b, and 7 are nonessential for virus 
replication in culture cells, and gene E, although essential for 
virus production, does not affect genome replication (21, 22, 
28). A modified TGEV infectious cDNA clone (TGEV-RS), in 
which overlapping genes were separated by duplication of 
TRSs and introduction of unique restriction sites at the 5’ end 
of each gene (22), was used for the construction of three 
TGEYV replicons by sequential deletion of structural and non- 
structural genes nonessential for virus genome replication: (i) 
REP 1, containing the untranslated 5’ and 3’ regions of the 
genome, the replicase, M, N, and 7 genes; (ii) REP 2, in which 
gene M was deleted; (ili) and REP 3, with genes M and N 
deleted (Fig. 1A). Gene 7 was maintained in all cases because 
it may contain necessary cis elements for genome replication 
due to its proximity to the 3’ end of the TGEV genome. 
Expression of gene 7 mRNA was used to study replicon func- 
tionality by reverse transcription-PCR (RT-PCR) analysis. Ad- 
ditionally, a multicloning site containing unique restriction 
sites, AvrII, MlulI, Swal, and Fsel, was cloned downstream 
of the replicase gene to allow cloning and expression of 
heterologous genes. The detailed cloning strategy, plasmid 
maps, and sequences are available from the authors upon 
request. 

Activity of TGEV-derived replicons in several cell lines. To 
study the functionality of TGEV replicons, human 293T cells, 
which are transfected with efficiencies over 40%, were used in 
the assay. 293T cells were grown to 95% confluence on 35- 
mm-diameter plates and transfected with 4 wg of either TGEV 
replicons, the full-length cDNA clone, or a nonreplicative 
cDNA clone, using 12 wg of Lipofectamine 2000 (Invitrogen) 
according to the manufacturer’s specifications. Total intracel- 
lular RNA was extracted at 1.5, 5, and 10 days posttransfection 
(dpt) with the RNeasy Mini Kit (QIAGEN) and used as the 
template for RT-PCR analysis of gene 7 mRNA transcription. 
RT reactions were performed with Moloney murine leukemia 
virus reverse transcriptase (Ambion) and the antisense primer 
7(213)RS (5'-TCTGTAGCAGCAAAATCC-3’), complemen- 
tary to nucleotides 199 to 216 of gene 7. The cDNAs generated 
were amplified by PCR using the reverse primer 7(213)RS and 
the forward primer SP (5’-GTGAGTGTAGCGTGGCTATA 
TCTCTTC-3’), spanning nucleotides 15 to 41 of the TGEV 
leader sequence. High levels of gene 7 MRNA were detected in 
cells transfected with REP 1 and 2 (Fig. 1B), indicating that 
both replicons are functional, at least at the transcription level. 
However, no transcript was detected in cells transfected with 
REP 3, lacking the N gene, indicating a possible role of N 
protein in replicon activity (Fig. 1B). Interestingly, gene 7 
transcripts were detected in REP 1 and 2 even after 10 dpt 
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(three cell passages), suggesting that these replicons had very 
little cytopathicity or were noncytopathic in human cells. Iden- 
tical experiments were performed with BHK-pAPN and ST 
cells, but in these cases TGEV replicons were cytopathic and 
gene 7 mRNA was detected only in the samples analyzed up to 
36 h posttransfection (hpt) but not later (data not shown). A 
quantitative analysis of gene 7 mRNA in transfected 293T, 
BHK-pAPN, and ST cells was performed by real-time RT- 
PCR. The primers used for RT (7RS, 5’-AAAACTGTAATA 
AATACAGCATGGAGGAA-3’, complementary to nucleo- 
tides 10 to 38 of gene 7) and PCR (reverse primer 7RS and the 
forward primer LDVS, 5'-CGTGGCTATATCTCTTCTTTTA 
CTTTAACTAG-3’, spanning nucleotides 24 to 55 of the 
TGEV leader sequence) were designed by using the Primer 
Express software (Applied Biosystems). SYBR Green PCR 
master mix was used in the PCR step following the manufac- 
turer’s specifications (Applied Biosystems). In 293T cells the 
amount of transcript was practically identical in the samples 
analyzed at 1.5 and 10 dpt, in contrast to those in BHK-pAPN 
and ST cells, in which transcript expression was undetectable 
after 2 dpt, reinforcing the idea that TGEV replicons are 
noncytopathic in human 293T cells. As the TGEV replication 
level was about threefold lower in 293T cells than in BHK- 
pAPN cells (data not shown), this could explain the fact that 
TGEYV replicons were cytopathic in BHK-pAPN cells and not 
in 293T cells. However, replicon toxicity could also be due to a 
cell-specific factor. Finally, the amount of the cDNA encoding 
the replicon RNA and the RNA itself were analyzed at differ- 
ent dpt. Replicon RNA showed a higher stability than the 
cDNA (data not shown), suggesting that the TGEV replicon 
RNA was also functional at the replication level. 

Role of N protein provided either in cis or in ¢rans in repli- 
con activity. To confirm whether the N protein is important for 
TGEV replicon activity, the synthesis of gene 7 mRNA was 
analyzed by RT-PCR in BHK-pAPN cells transformed with 
the Sindbis virus replicon pSINrep21 (9) expressing TGEV N 
protein. To show the specificity of N protein expression, similar 
analyses were carried out with BHK-pAPN cells and BHK- 
pAPN cells transformed with the Sindbis virus replicon alone 
or with the replicon expressing the TGEV E protein (Fig. 2A 
and B). The indicated cells were grown to 95% confluence on 
35-mm-diameter plates and transfected with 4 pg of each 
TGEV replicon, using 12 pg of Lipofectamine 2000 (Invitro- 
gen). At 36 hpt total intracellular RNA was extracted and used 
as a template for RT-PCR analysis of gene 7 mRNA following 
the procedure described above. Both, REP 1 and 2 were func- 
tional in all analyzed cells. However, in the case of REP 3 high 
levels of gene 7 MRNA were detected only in cells expressing 
TGEV N protein. These results indicated that N protein pro- 
vided either in cis or in trans is required for efficient replicon 
activity. A quantitative analysis of gene 7 mRNA in BHK- 
pAPN cells or BHK-pAPN cells expressing TGEV N protein 
transfected with TGEV replicons was performed by real-time 
RT-PCR using the primers 7RS and LDVS, described above 
(Fig. 2C). A basal activity of REP 3 was detected, but the 
replicon activity increased more than 100-fold when N protein 
was provided in cis (REP 1 and REP 2 versus REP 3 activity in 
BHK-pAPN cells) and more than 1,000-fold when N protein 
was provided in trans (REP 3 activity in BHK-pAPN cells 
expressing TGEV N protein versus REP 3 activity in BHK- 
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FIG. 1. Functional analysis of TGEV-derived replicons. (A) Genetic structures of TGEV-derived replicons. The genetic structures of the 
TGEV cDNA clone (TGEV-RS) and the replicons generated from this CDNA (REP 1, REP 2, and REP 3) are illustrated. To construct REP 1, 
a 125-bp fragment containing the restriction sites Pacl, Avr, Mlul, Swal, and Fsel, generated by PCR using two overlapping oligonucleotides, 
was cloned in TGEV-RS digested with PacI and FseI. REP 2 and REP 3 were generated from REP 1 by deletion of fragments Swal-Pmel and 
Swal-Ascl, respectively. To avoid possible interferences with the expression of heterologous genes, the TRS of the S gene, located at the 3’ end 
of the replicase gene, was eliminated by introduction of three silent point mutations in its conserved core sequence. Letters and numbers above 
the bars indicate the viral genes. L, leader sequence; UTR, untranslated region. Relevant restriction sites are indicated. The core sequence is 
underlined. (B) Functional analysis of TGEV-derived replicons by RT-PCR. Human 293T cells were mock transfected (M) or transfected with 
TGEV replicons (REP 1, 2, and 3), the full-length cDNA clone (FL-RS), or a nonreplicative cDNA clone (NR), using Lipofectamine 2000 
(Invitrogen) according to the manufacturer’s specifications. Total intracellular RNA was isolated at 1.5, 5, and 10 dpt and analyzed by RT-PCR 
with specific oligonucleotides to detect gene 7 mRNA. Duplicate RT-PCR products amplified in parallel were resolved by electrophoresis in 1% 
agarose gels. MW, molecular weight markers. 


pAPN cells). Interestingly, the activity of REP 1 and 2, carrying 
the N gene, was also increased when the N protein was pro- 


responsible for TGEV replicon activity enhancement, the ac- 
tivity of REP 3 was analyzed by real-time RT-PCR in BHK- 


vided in trans, suggesting that the N protein accumulation level 
is a critical factor or that the presence of N protein is important 
at the beginning of the replication process. 

To discard the possibility that gene N mRNA itself was 


pAPN cells transformed with the Sindbis virus replicon ex- 
pressing either gene N mRNA or the same mRNA mutated in 
order to eliminate N protein synthesis. To achieve this objec- 
tive, three different mutants were constructed, in which one, 
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FIG. 2. Effect of N protein on TGEV-derived replicon activity. 
(A) Scheme of the Sindbis virus replicon construct encoding the 
TGEV N gene. RSV Pr, Rous sarcoma virus promoter; sgRNA, sub- 
genomic RNA; nsP1-4, nonstructural proteins 1 to 4; nsP2mut, mutant 
of nonstructural protein 2; Pac, puromycin resistance gene; SV40 poly 
A, transcription-termination polyadenylation signal from simian virus 
40. (B) RT-PCR analysis of gene 7 mRNA. BHK-pAPN cells (BHK) 
or BHK-pAPN cells transformed with either the Sindbis virus replicon 
pSINrep21 alone (BHK + SIN) or pSINrep21 expressing TGEV E 
protein (BHK + SIN-E) or TGEV N protein (BHK + SIN-N) were 
transfected with TGEV replicons (REP 1, 2, and 3), using Lipofect- 
amine 2000 (Invitrogen) following the manufacturer’s specifications. 
At 36 hpt total intracellular RNA was extracted and used as the 
template for RT-PCR analysis with specific primers to detect gene 7 
mRNA. Duplicate RT-PCR products amplified in parallel were re- 
solved by electrophoresis in 1% agarose gels. MW, molecular weight 
markers. (C) Real-time RT-PCR quantification of gene 7 mRNA. The 
amount of mRNA7, expressed as relative units, was determined by 
real-time RT-PCR with specific oligonucleotides to detect gene 7 
mRNA in RNA samples isolated at 36 hpt from BHK-pAPN cells 
(BHK) or BHK-pAPN cells expressing N protein (BHK + N) trans- 
fected with either a nonreplicative cDNA clone (NR), REP 1, REP 2, 
or REP 3. REP 1-REP 2 indicates mean values from REP 1 and 2, both 
encoding the N gene. Mean values from three experiments are repre- 
sented, with standard deviations shown as error bars. 


two, or three ATGs in the N gene coding sequence were 
mutated to ATC (Fig. 3A). Expression of gene 7 mRNA was 
used to study replicon activity following the procedure de- 
scribed above. The synthesis of gene N mRNA by the Sindbis 
virus replicon was analyzed by real-time RT-PCR using the 
antisense primer 5’-CCAATAACCAATCTGTTGATCCCT- 
3’, complementary to nucleotides 187 to 219 of the N gene, and 
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the forward primer 5'-ACTTATGCTCGAGAGACTTTGTA 
CCC-3’, spanning nucleotides 146 to 171 of the N gene. A clear 
reduction in the activity of REP 3 (more than 85%) was de- 
tected in BHK-pAPN cells expressing the different gene N 
mRNA mutants. However, no differences in the expression of 
gene N mRNA between the wild type and the different mutants 
were detected, indicating that N protein itself and not its 
mRNA was responsible for TGEV replicon enhancement (Fig. 
3B). 

These data indicated that N protein is not strictly essential, 
but it plays an important role as an enhancer of coronavirus 
replicon activity. The mechanism mediating N protein en- 
hancement activity is unknown. N protein may be a critical 
component of the replication-transcription complex, or it may 
simply play a structural role to stabilize coronavirus replicon 
RNAs. This is the first time that a detailed and quantitative 
analysis of the requirement for N protein in coronavirus 
replication or transcription has been reported, although it 
was previously observed that N protein increases the rescue 
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FIG. 3. Effect of N protein and its mRNA on TGEV-derived rep- 
licon activity. (A) Schematic representation of the Sindbis virus repli- 
con encoding the different gene N mRNA mutants. The positions of 
the three ATGs in the N gene coding sequence, as well as the point 
mutations introduced, are indicated. The designations above the top 
bar are the same that those described for Fig. 2A. (B) Real-time 
RT-PCR quantification of genes 7 and N mRNAs. The amounts of 
genes 7 and N mRNAs, expressed as relative units, were determined by 
real-time RT-PCR with specific oligonucleotides in RNA samples iso- 
lates at 36 hpt from BHK cells expressing the wild-type gene NmRNA 
(BHK + N) or gene N mRNA with the first ATG mutated (BHK + 
N1), with the two first ATGs mutated (BHK + N2), or with the three 
ATGs mutated (BHK + N3) transfected with REP 3. Mean values 
from two experiments are represented, with standard deviations shown 
as error bars. 
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FIG. 4. Analysis of EGFP expression by TGEV-derived REP 2. (A) Schematic structure of the cDNA encoding the REP2-TRS3a-EGFP RNA 
with the EGFP gene under the control of gene 3a TRS. The numbers and letters inside the boxes indicate the viral genes. L, leader sequence; CMV, 
cytomegalovirus immediate-early promoter; pA, poly(A); Rz, hepatitis delta virus ribozyme; BGH, bovine growth hormone termination and 
polyadenylation sequences. (B) Flow cytometry analysis of EGFP expression in BHK-pAPN cells transfected with REP2-TRS3a-EGFP. A and B 
represent the EGFP-negative and -positive cell populations, respectively. Green fluorescence intensity is revealed in logarithmic units on the x axis. 
(C) EGFP expression analyzed by confocal microscopy of BHK-pAPN cells transfected with REP2-TRS3a-EGFP. Cell nuclei were stained with 


TOPRO 3 (Molecular Probes). 


of coronavirus from infectious RNA transcripts (4, 32, 35, 
37). 

Heterologous gene expression from TGEV-derived repli- 
cons. In order to evaluate the potential of TGEV replicons to 
express heterologous genes, the expression of enhanced green 
fluorescent protein (EGFP) by REP 2 was analyzed in BHK- 
pAPN cells. A 1,115-bp cDNA fragment containing the EGFP 
gene downstream of native gene 3a TRS, comprising 262 nucle- 
otides of the 5’ TRS, the core sequence, and 23 nucleotides of the 
3’ TRS preceding the translation initiation codon, was am- 
plified by PCR with the forward primer 5'’-GCGGATATC 
TGTTTAAACGTGTGGCTACTAATAGGCTTAGTAG-3’ 
(the restriction sites EcoRV and Pmel are underlined) and the 
reverse primer 5’-GGGGATATCGGCGCCAATATTTAA 
ATAAAATCACCATTGAG-3’ (the restriction sites EcoRV 
and SfoI are underlined), using as the template the plasmid 
pBAC-TGEV-A3ab-EGFP (a full-length TGEV cDNA clone 
expressing EGFP under the control of TRS3a). The PCR 
product (TRS3a-EGFP) was digested with restriction endonu- 
clease EcoRV and cloned into REP 2 digested with Mlul and 
treated with Klenow enzyme to generate the plasmid REP2- 
TRS3a-EGFP (Fig. 4A). BHK-pAPN cells were grown to 95% 
confluence and transfected with REP2-TRS3a-EGFP, using 


Lipofectamine 2000 (Invitrogen) as described in the previous 
section, and EGFP expression was analyzed at 36 hpt by flow 
cytometry (Fig. 4B) and confocal microscopy (Fig. 4C). A 
significative proportion of cells (18%) were transfected, and 
more than 80% of them expressed high levels of EGFP (Fig. 
4C), indicating that a majority of cells transfected with the 
replicon expressed the heterologous gene. The strategy de- 
scribed here, in which the replicon encoding the N protein was 
launched from the cell nucleus by using the CMV promoter, 
allowed heterologous protein expression levels at least 1,000- 
fold higher than those previously reported with the human 
coronavirus 229E replicon (31). 

The availability of functional replicons will significantly fa- 
cilitate the study of the coronavirus replication mechanism and 
the production of safe TGEV-derived vectors for vaccination 
and possibly gene therapy. 
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